L . Moench , O . Rose , eds . ON STEP SIZES , STOCHASTIC SHORTEST PATHS , AND SURVIVAL PROBABILITIES IN REINFORCEMENT LEARNING

نویسندگان

  • S. J. Mason
  • R. R. Hill
  • L. Moench
  • O. Rose
  • Abhijit Gosavi
چکیده

Reinforcement Learning (RL) is a simulation-based technique useful in solving Markov decision processes if their transition probabilities are not easily obtainable or if the problems have a very large number of states. We present an empirical study of (i) the effect of step-sizes (learning rules) in the convergence of RL algorithms, (ii) stochastic shortest paths in solving average reward problems via RL, and (iii) the notion of survival probabilities (downside risk) in RL. We also study the impact of step sizes when function approximation is combined with RL. Our experiments yield some interesting insights that will be useful in practice when RL algorithms are implemented within simulators.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Efficient Method for Selecting a Reliable Path under Uncertainty Conditions

In a network that has the potential to block some paths, choosing a reliable path, so that its survival probability is high, is an important and practical issue. The importance of this issue is very considerable in critical situations such as natural disasters, floods and earthquakes. In the case of the reliable path, survival or blocking of each arc on a network in critical situations is an un...

متن کامل

Predisaster Preparation of Transportation Networks

We develop a new approach for a pre-disaster planning problem which consists in computing an optimal investment plan to strengthen a transportation network, given that a future disaster probabilistically destroys links in the network. We show how the problem can be formulated as a non-linear integer program and devise an AI algorithm to solve it. In particular, we introduce a new type of extrem...

متن کامل

Intersection properties of Brownian paths

This review presents a modern approach to intersections of Brownian paths. It exploits the fundamental link between intersection properties and percolation processes on trees. More precisely, a Brownians path is intersect-equivalent to certain fractal percolation. It means that the intersection probabilities of Brownian paths can be estimated up to constant factors by survival probabilities of ...

متن کامل

Evaluation of Multiagent Search Performance Revised Proposal & Midterm Review

Pathfinding the simple process of finding a route from one point to another comes with ease to humans as well as animals and is as essential to survival as is to convenience. On the other hand, it is a remarkably difficult task to replicate in the artificial world. Because it is essential to numerous technological applications, notably autonomous locomotion of mobile robots, movement of agents ...

متن کامل

k-Survivability: Diversity and Survival of Expendable Robots

We define the k-survivability of a set of n paths as the probability that at least k out of n robots following those paths through a stochastic threat environment reach goals. High k-survivability sets tend to contain short and diverse paths. Finding sets of paths with maximum k-survivability is NPhard. We design two algorithms: a complete algorithm that finds an optimal list of paths, and a he...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009